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© A high performance data packet buffering meth- 
od and a programmable data communication adapter 
for high speed packet transmission networks are 
disclosed. The line adapter includes programmable 
processing means, for receiving and transmitting 
data packets of fixed or variable length. This system 
is characterized in that it comprises 

• means for buffering (132) said data packets, 

• means for identifying said buffering means and 
said data packets in said buffering means, 

• means for queueing (Figure 15) in storing 
means (131) said identifying means in a single 
instruction, 



• means for dequeueing (Figure 16) from said 
storing (131) means said identifying means in 
another single instruction , 

• means for releasing said buffering means, 
Each instruction comprises up to three oper- 
ations executed in parallel by said processing means 

• an arithmetical and logical (ALU) operation on 
said identifying means, 

• a memory operation on said storing means, 
and 

• a sequence operation. 
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Technical field 

The present invention relates to a high perfor- 
mance packet buffering system and method in a 
programmable data communication adapter of a 
packet switching node of a high speed network. 

Background art 

Technical and Market Trends 

The telecommunication environment is in full 
evolution and has changed considerably this recent 
years. The principal reason has been the spectacu- 
lar progress realized in the communication technol- 
ogy : 

• the maturing of fiber optical transmission. 
High speed rates can now be sustained with 
very low bit error rates. 

• the universal use of digital technologies with- 
in private and public telecommunications net- 
works. 

In relation with these new emerging technol- 
ogies, the offer of the telecommunication com- 
panies, public or private, are evolving : 

• The emergence of high speed transmissions 
entails an explosion in the high bandwidth 
connectivity. 

• the increase of the communication capacity 
generates more attractive tariffs. 

• A higher flexibility is offered to the users to 
manage their growth through a wide range of 
connectivity options , an efficient bandwidth 
management and the support of new media. 

• Once sampled and digitally encoded, voice, 
video and image derived data can be merged 
with pure data for a common and transparent 
transport. 

Abundant, cheap communications means that 
many potential applications that where not possible 
before because of cost are now becoming attrac- 
tive. In this environment, four generic requirements 
are expressed by the users : 

• Doing old applications better, 

• Optimizing communication networks, 

• Doing new applications. 

High Performance Networks 

In a first step, T1 backbone networks were 
primarily deployed with TDM (Time Division Mul- 
tiplexing) technology to achieve cost savings 
through line aggregation. These systems easily 
supported the fixed bandwidth requirements of 
host/terminal computing and 64 Kbps PCM (Pulse 
Code Modulation) voice traffic. 

The data transmission is now evolving with a 
specific focus on applications and by integrating a 
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fundamental shift in the customer traffic profile. 
Driven by the growth of workstations, the local area 
networks (LAN) interconnection, the distributed pro- 
cessing between workstations and super comput- 

5 ers, the new applications and the integration of 
various and often conflicting structures - hierarchi- 
cal versus peer to peer, wide (WAN) versus local 
(LAN) area networks, voice versus data - the data 
profile has become higher in bandwidth, bursting, 

70 non deterministic and requires more connectivity. 
Based on the above, it is clear that there is strong 
requirement to support distributed computing ap- 
plications across high speed backbones that may 
be carrying LAN traffic, voice, video, and traffic 

75 among channel attached hosts, business worksta- 
tions, engineering workstations terminals, and small 
to intermediate file servers. This traffic reflects a 
heterogeneous mix of : 

• end user network protocols including Ether- 
20 net, Tocken Ring, APPN, FDDI, OSI, ISDN, 

ATM .... and 

• real time (steady stream traffic such as voice 
and video) and non real time (bursty nature 
traffic such as interactive data) transmissions. 

25 This vision of a high speed protocol-agile back- 

bone network is the driver for the emergence of 
fast packet switching networks architectures in 
which data, voice, and video information is digitally 
encoded, chopped into small packets and transmit- 

30 ted through a common set of nodes and links. 
Although low speed links may exist, the availability 
of fiber optic links will make cost effective to have 
a few links of high speed rather that many links of 
low speed. In addition to the high speed backbone, 

35 there exists a peripheral network which essentially 
provides access to the switching nodes. This pe- 
ripheral network is composed of relatively low 
speed links which may not use the same protocols 
or switching techniques used in the backbone. In 

40 addition, the peripheral network performs the task 
of multiplexing the relatively slow end users traffic 
to the high speed backbone. Thus, backbone 
switching nodes are principally handling high 
speed lines. The number of high speed links enter- 

45 ing each switching node is relatively small but the 
aggregate throughput very high in the Giga-bits per 
second range. 

Throughput 

50 

The key requirement of these new architec- 
tures is to reduce the end-to-end delay in order to 
satisfy real time delivery constraints and to achieve 
the necessary high nodal throughput for the trans- 
55 port of voice and video. Increases in link speeds 
have not been matched by proportionate increases 
in the processing speeds of communication nodes 
and the fundamental challenge for high speed net- 
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works is to minimize the packet processing time 
within each node. As example, for meeting a typi- 
cal 100 ms delay to deliver a voice packet between 
two end users : 

• A total of 36 ms might be needed for the 
packetization and play-out functions at the 
end points. 

• About 20 ms is the unalterable propagation 
delay needed, say, to cross the United 
States. 

• There remains 44 ms for all the intra-node 
processing time as the packet moves through 
the network. In a 5 nodes network, each node 
would have about 8 ms for all processing 
time including any queueing time. In a 10 
nodes network, each node would have about 
4 ms. 

Another way of looking the same constraint is 
illustrated in Figure 1 : taking a node with an 
effective processing rate of i MIPS (Millions of 
Instructions Per Second), it is possible to fill a 9.6 
kbps line with 1000 byte packets even if a network 
node must execute 833 000 instructions per packet 
processed. For a 64 kbps line the node can afford 
125 000 instructions per packet. In order to fill an 
OC24 link, however, our 1 MIPS node could only 
execute 7 instruction per packet | In the latter case 
even an effective rate of 10-30 MIPS would allow 
only 70-200 instructions per packet. 

In order to minimize the processing time and 
to take full advantage of the high speed/low error 
rate technologies, most of the transport functions 
provided by the new high bandwidth network ar- 
chitectures are performed on an end-to-end basis. 
This includes the flow control and error recovery 
for data, the packetization and reassembly for voice 
and video. The protocol is simplified : 

• First, there is no need for transit node to be 
aware of individual (end user to end user) 
transport connections. 

• Secondly high performance and high quality 
links does not require any more node to node 
error recovery or re-transmission. Congestion 
and and flow control are managed at the 
access and end points of the network con- 
nections reducing both the awareness and 
the function of the intermediate nodes. 

Packet size 

Blocks of user data offered for transmission 
vary widely in size. If these blocks are broken up 
into many short "packets" then the transit delay for 
the whole block across the network will be consid- 
erably shorter, this is because when a block is 
broken up into many short packets, each packet 
can be processed by the network separately and 
the first few packets of a block may be received at 



BNSDOCID: <EP 0632625A1J_> 



the destination before the last packet is transmitted 
by the source. 

Limiting all data traffic to a small maximum 
length also has the effect of smoothing out queue- 
5 ing delays in intermediate nodes and thus provid- 
ing a much more even transit delay characteristic 
than is possible if blocks are allowed to be any 
length. Another benefits to short packets, for exam- 
ple, is to manage easier a pool of fixed length 

w buffers in an intermediate node if it is known that 
each packet will fit in just one buffer and if packets 
are short and delivered at a constant, relatively 
even rate then the amount of storage needed in the 
node buffer pool is minimized. 

75 However, there is a big problem with short 

packet sizes. It is a characteristic of the architec- 
ture of traditional packet switching nodes that 
switching a packet takes a certain amount of time 
or number of instructions regardless of the length 

20 of the packet. That is, a 1000 bytes block requires 
almost the same node resource to switch as does a 
100 bytes block. So if you break a 1000 bytes 
packet up into 10, 100 bytes packets then you 
multiply the load on an intermediate switching node 

25 by 10. This effect was not too critical when nodes 
were very fast and links were very slow. Today, 
when links are very fast and nodes are relatively 
slow, this characteristic is a significant limitation on 
network throughput. 

30 Short packets are more suitable for the trans- 

mission of real time data, like voice or video pack- 
ets, which must be delivered to the receiver at a 
steady, uniform rate (isochronous mode). An ele- 
gant solution to both the transit delay and error 

35 recovery problems is to use very short packets of 
fixed length. Furthermore, this technology simpli- 
fies (and therefore speeds up ) the switching hard- 
ware needed in nodal switches. For example, the 
ATM standard (Asynchronous Transfer Mode) is 

40 using data cell size of 48 bytes (a cell being a 
packet of fixed length), only the routing header (5 
bytes) is checked for validity and the data within 
the cell is left to the end-to-end protocol that 
means to a "higher layer" protocol managed by 

45 the application. 

In another side, pure data are generated in a 
very bursty and non deterministic manner but does 
not have a problem with transit delay. This data 
can be delayed without necessarily degrading the 

so quality of the information presented to an end user 
at the destination. The longer the packet is, the 
fewer packets per second must be switched for a 
given data throughput. However, the non predict- 
able character of the traffic usually requires a 

55 heavy handling of buffers and queues at the ex- 
pense of the response time. 

In order to take full advantage of the different 
data packet transmission systems, the data transfer 

4 
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across the network must be done with packets of 
nearly the same size as the user packets without 
processing them into artificial lengths. As opposed 
to solely data networks or solely voice or video 
networks, the high speed network architectures 
have to support a plurality of heterogeneous trans- 
mission protocols operating with variable length 
packets. 

Connectivity 

In a high speed network, the nodes must pro- 
vide a total connectivity. This includes attachment 
of the customer's devices, regardless of vendor or 
protocol, and the ability to have the end user 
communicate with any other device. Traffic types 
include data, voice, video, fax, graphic, image. The 
node must be able to take advantage of all com- 
mon carrier facilities and to be adaptable to a 
plurality of protocols : all needed conversions must 
be automatic and transparent to the end user. For 
example, a high speed node must not have any 
dependencies on the existence of SNA (System 
Network Architecture) equipments on a user net- 
work. It has to be able to offer a similar level of 
service in a SNA environment as in a non-SNA 
environment made of routers, Private Branch ex- 
changes (PBXs), Local Area Networks (LAN) ... 

Key requirements 

The efficient transport of mixed traffic streams 
on very high speed lines means for each commu- 
nication node of the network a set of requirements 
in term of performance which can be summarized 
as follows : 

• a very short packet processing time, 

• a very high throughput, 

• an efficient queue and buffer management, 

• a limited number of instructions per packet, 

• a very large flexibility to support a wide range 
of connectivity options. 

The high bandwidth dictates the need of spe- 
cialized hardware to support very fast packet han- 
dling and control protocols, and to satisfy the real 
time transmission needs of the voice and video 
traffic. The processing time being the main bot- 
tleneck in high speed networks, most of the com- 
munication nodes today are built around high 
speed switching hardware to off-load the routing 
packet handling and routing functions from the 
processor. 

However, on equal performances, a software 
approach represents the most adequate solution for 
each node to meet the connectivity and flexibility 
requirements and to optimize the manufacturing 
and adaptation costs. The line adapters, are based 
on a common hardware design and are configured 



by means of a specific programming to execute 
either the access point or inter nodal transport 
functions. The adaptability of the adapters to sup- 
port different access protocols and data streams - 

5 Frame Relay, HDLC (High level Data Link Control), 
CBO (Continuous Bit Operations), ATM (Asynchro- 
nous Transfer Mode), ... - is provided by logical 
components called Access Agents. Such logical 
associations Adapter/Access Agent are specified 

io by software, providing a very large flexibility at a 
reduced cost. Each line adapter is automatically 
configured at system initiation according to : 

• the adapter function, and 

• the access protocol. 

75 

Summary of the invention 

In accordance with the present invention, a 
high performance packet buffering method and a 

20 programmable data communication adapter system 
are disclosed. The line adapter includes program- 
mable processing means, for receiving and trans- 
mitting data packets of fixed or variable length. 
This system is characterized in that it comprises 

25 • means for buffering (132) said data packets, 

• means for identifying said data packets in 
said buffering means, 

• means for queueing (Figure 15) in storing 
means (131) said identifying means in a sin- 

30 gle instruction, 

• means for dequeueing (Figure 16) from said 
storing (131) means said identifying means in 
another single instruction , 

• means for releasing said buffering means, 

35 Each instruction comprises up to three oper- 

ations executed in parallel by said processing 
means : 

• an arithmetical and logical (ALU) operation on 
said identifying means, 

40 • a memory operation on said storing means, 
and 

• a sequence operation. 
Description of the drawings 

45 

Figure 1 shows the processing times (or num- 
ber of instructions per second) required in function 
of the different line throughputs supported by the 
present invention, 
so Figure 2 shows a typical model of high speed 
packet switching network including the access and 
transit nodes claimed in the present invention. 

Figure 3 describes a high speed Routing Point 
according to the present invention. 
55 Figure 4 shows a programmable high perfor- 

mance adapter as claimed in the present invention. 

Figure 5 represents the receive and transmit 
data flows in a Trunk adapter according to the 
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present invention 

Figure 6 illustrates the buffer, packet and 
queue structures according to the present inven- 
tion. 

Figure 7 illustrates the packet segmentation 
mechanism according to the present invention. 

Figure 8 represents the Buffer Pointer structure 
according to the present invention. 

Figure 9 represents the List Pointer structure 
according to the present invention. 

Figure 10 illustrates the List IN operation ac- 
cording to the present invention. 

Figure 11 illustrates the List_OUT operation 
according to the present invention. 

Figure 12 represents the Free Buffer List struc- 
ture according to the present invention. 

Figure 13 represents the Processor functional 
structure according to the present invention. 

Figure 14 illustrates the dequeueing mecha- 
nism according to the present invention. 

Figure 15 illustrates the queueing mechanism 
according to the present invention. 

Figure 16 shows a simplified view of a de- 
queueing cycle according to the present invention. 

Description of the preferred embodiment of the 
invention 

As illustrated in Figure 2, a typical model of 
communication system is made of several user 
networks (212) communicating through a high per- 
formance network (200) using private lines, carrier 
provided services, or public data networks. Each 
user network can be described as a set of commu- 
nication processors and links (211) interconnecting 
large computers used as Enterprise Servers (213), 
user groups using workstations or personnel com- 
puters attached on LAN (Local Area Networks 214), 
applications servers (215), PBX (Private Branch 
exchange 216) or video servers (217). These user 
networks, dispersed in different establishments, 
need to be interconnected through wide area trans- 
port facilities and different approaches can be used 
for organizing the data transfer. Some architectures 
involve the checking for data integrity at each net- 
work node, thus slowing down the transmission. 
Others are essentially looking for a high speed data 
transfer and to that end the transmission, routing 
and switching techniques within the nodes are op- 
timized to process the flowing packets towards 
their final destination at the highest possible rate. 
The present invention belongs essentially to the 
latter category and more particularly to the fast 
packet switching network architecture detailed in 
the following paragraphs. 



High speed packet switching networks 

The general view in Figure 2 shows a fast 
packet switching transmission system comprising 

5 eight nodes (201 to 208) each node being intercon- 
nected by means of high speed communication 
lines called Trunks (209). The access (210) to the 
high speed network by the users is realized 
through Access Nodes (202 to 205) located at the 

w periphery. These Access Nodes comprise one or 
more Ports, each one providing an access point for 
attaching external devices supporting standard in- 
terfaces to the network and performing the conver- 
sions required to transport the users data flow 

75 across the network from and to other external de- 
vices. As example, the Access Node 202 interfaces 
respectively a Private Branch exchange (PBX), an 
application server and a hub through three Ports 
and communicates through the network by means 

20 of the adjacent Transit Nodes 201 , 208 and 205. 

Switching Nodes 

Each network node (201 to 208) includes a 
25 Routing Point where the incoming data packets are 
selectively routed on the outgoing Trunks towards 
the neighboring Transit Nodes. Such routing de- 
cisions are made according to the information con- 
tained in the header of the data packets. In addition 
30 to the basic packet routing function, the network 
nodes also provide ancillary services such as : 

• the determination of routing paths for packets 
originated in the node, 

• directory services like retrieving and updating 
35 information about network users and re- 
sources, 

• the maintaining of a consistent view of the 
physical network topology, including link utili- 
zation information, and 

40 • the reservation of resources at access points 
of the network. 
Each Port is connected to a plurality of user 
processing equipments, each user equipment com- 
prising either a source of digital data to be trans- 

45 mitted to another user system, or a data sink for 
consuming digital data received from another user 
system, or, typically, both. The interpretation of the 
users protocols, the translation of the users data 
into packets formatted appropriately for their trans- 

50 mission on the packet network (200) and the gen- 
eration of a header to route these packets are 
executed by an Access Agent running in the Port. 
This header is made of Control and Routing Fields. 

• The Routing Fields contain all the information 
55 necessary to route the packet through the 

network (200) to the destination End Node to 
which it is addressed. 
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• The Control Fields include, among other 
things, an encoded identification of the pro- 
tocol to be used in interpreting the Routing 
Field . 

Routing Points 

Figure 3 shows a general block diagram of a 
typical Routing Point (300) such as it can be found 
in the network Nodes (201 to 208) illustrated in 
Figure 2. A Routing Point comprises a high speed 
packet Switch (302) onto which packets arriving at 
the Routing Point are entered. Such packets are 
received : 

• from other nodes over high speed transmis- 
sion links (303) via Trunk Adapters (304). 

• from users via application adapters called 
Ports (301). 

Using information in the packet header, the 
adapters (304, 301) determine which packets are to 
be routed by means of the Switch (302) towards a 
local user network (307) or towards a transmission 
link (303) leaving the Node. The adapters (301 and 
304) include queuing circuits for queuing packets 
prior to or subsequent to their launch on the Switch 
(302). 

The Route Controller (305) calculates the opti- 
mum routes through the network (200) so as to 
minimize the amount of network resources used to 
complete a communication path and builds the 
header of the packets generated in the Routing 
Point. The optimization criteria includes the char- 
acteristics of the connection request, the capabil- 
ities and the utilization of the Trunks in the path, 
the number of intermediate nodes ... All the in- 
formation necessary for the routing, about the 
nodes and transmission links connected to the 
nodes, are contained in a Network Topology 
Database (306). Under steady state conditions, ev- 
ery Routing Point has the same view of the net- 
work. The network topology information is updated 
when new links are activated or new nodes added 
to the network. Such information is exchanged by 
means of control messages with all other Route 
Controllers to provide the necessary up-to-date in- 
formation needed for route calculation (such 
database updates are carried on packets very simi- 
lar to the data packets between end users of the 
network). The fact that the network topology is kept 
current in every node through continuous updates 
allows dynamic network reconfigurations without 
disrupting end users logical sessions 
The incoming transmission links to the packet 
Routing Point may comprise links from external 
devices in the local user networks (210) or links 
(Trunks) from adjacent network nodes (209). In any 
case, the Routing Point operates in the same man- 
ner to receive each data packet and forward it on 



to another Routing Point as dictated by the in- 
formation in the packet header. The fast packet 
switching network operates to enable a commu- 
nication between any two end user applications 

5 without dedicating any transmission or node facili- 
ties to that communication path except for the 
duration of a single packet. In this way, the utiliza- 
tion of the communication facilities of the packet 
network is optimized to carry significantly more 

w traffic than would be possible with dedicated trans- 
mission links for each communication path. 

Ports and Trunk Adapters 

75 Adapters Function 

Ports are located at the boundary of the high 
speed network. They allow terminal equipments to 
exchange information through the high speed net- 
20 work without the need for knowing the specific high 
speed protocol used. The main function of the 
Ports are : 

• receiving foreign protocol data units from an 
external resource and forwarding them as 

25 high speed packets over the network to a 

target Port, and 

• converting high speed packets back to for- 
eign protocol data units and sending them to 
the target resource, 

30 • controlling the bandwidth. 

Note : the source and target Ports may be 
located in the same node. 

Trunks are the links between the high speed 
network nodes. They carry high speed packets. 
35 Each Trunk manage its link bandwidth and link 
status. A critical task of the Trunks is the manage- 
ment of traffic priorities and allocation of internal 
buffers to reduce delay and congestion. 

In addition, there is a special type of adapter 
40 called Route Controller Adapter (305) which : 

• communicates with the other adapters (301, 
304) through the Switch (302), 

• implements the centralized functions of the 
Route Controller such as the topology, the 

45 path selection .... 

• establishes, maintains and cancels end-to- 
end high speed connections. 

Adapters Architecture 

50 

Several techniques for designing said Ports, 
Trunk and Route Controller Adapters, to obtain 
more or less flexible and efficient transmission sys- 
tems. Most of the adapters today are built around a 
55 specialized hardware depending on the function 
and protocol of the connected links. 

The present invention, to satisfy the previously 
enumerated connectivity and flexibility require- 
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ments, provides a software solution based on a 
common hardware structure. Port and Trunk Adapt- 
ers present the same architecture and their func- 
tional differentiation is realized through a specific 
programming. However, even using the most effi- 
cient general purpose microprocessor today avail- 
able on the market, the experience shows that it is 
very difficult to reach the desired level of perfor- 
mance in term of number of switched packet per 
second. This is the reason why the control of each 
adapter has been shared between two processors : 
a Specific Purpose Processors (SPP, 406, 412), 
optimized for the packet switching and a General 
Purpose Processor (GPP, 409), the first dealing 
with the packet to be switched, the critical process- 
ing in term of performance, and the second with 
the adapter management. 

As shown in Figure 4, each adapter (400) com- 
prises the following logic components : 

1 A General Purpose Processor (GPP, 409) 
whose programming depends of the selected 
Port or Trunk Adapter function. The GPP imple- 
ments the adapter control operations. 
2. A Receive Adapter (401) for implementing 
three functions : 

• the checking of the high speed packets 
header. 

• the traffic discrimination according to the 
routing mode specified in the header of 
every incoming packet, 

• the routing of the incoming packets 
through the Switch (403) with the appro- 
priate header. 

The Receive Adapter includes : 

a. a Line Receiver (407) for handling the 
data movements between a Line Interface 
(415) and a Receive Buffer Memory (RBM, 
405). 

b. a Receive Buffer Memory (RBM, 405) to 
temporarily store users data packets. 

c. a Receive Specific Purpose Processor 

(RSPP, 406) based on a specialized micro- 
processor comprising a Local Memory (LM, 
408). The RSPP handles the received steady 
state packet flow and forwards the control 
packets to the General Purpose Processor 
(409). 

d. a Local Memory (LM, 408) used by the 
RSPP (406) as work space. 

e. a Switch Transmitter Adapter (404) for 

- handling the data flow transferred from 
the buffer Memory (RBM, 405) under 
the control of the Receive Specific Pur- 
pose Processor (406) 

- segmenting this flow in fixed length 
cells and, 

- generating an appropriate switch rout- 
ing header 
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3. a Transmit Adapter (402) for implementing 
the following functions : 

• the reception of the data flow from the 
Switch (403), 

5 • the checking of the cells header. 

• the reassembly in packets (Port), 

• the Trunk functions (Trunk adapter), 

• the routing. 

The Transmit Adapter includes : 
w a. a Switch Receiver (410) for handling the 

flow coming from the Switch (403) and trans- 
ferring it to the Buffer Memory for reassem- 
bly. 

b. a Transmit Specific Purpose Processor 

is (XSPP, 412) similar to the Receive Specific 

Purpose Processor (406). The XSPP handles 
the steady state data and forwards the control 
flow to the General Purpose Processor (GPP, 
409). 

2 o c. a Line Transmitter Adapter (413) for 

handling the data movements between the 
Buffer Memory (411) and the Line Interface 
(415). 

The adapters are connected on one side on 
25 the packet Switch and on the other side on the 
Line Interfaces : 

• The Line Interfaces (415) are used to ad- 
apt the Port and Trunk Adapter physical 
interfaces to the appropriate media. 

30 • The packet Switch (302, 403) allows the 

Route Controller (305) and the different 
Ports (301), Trunk Adapters (304) to com- 
municate. 

35 Data Flow Control 

The receive and transmit data flows in the 
Trunk Adapters are represented in Figure 5. In a 
proprietary high speed network with packets of 
40 variable lengths, the receive process involves the 
steps of : 

1. Line Receiver, Buffer Memory, Specific Pur- 
pose Processor (501) system 

a. receiving the packets from the line, 
45 b. checking the packets header and in case 

of error discarding the packets, 

c. processing the information contained in the 
packets header according the routing mode, 

d. routing the control messages towards the 
50 General Purpose Processor (GPP, 502) 

e. encapsulating the packets with a specific 
switch header in function of the destination 
adapter, 

f. forwarding the packets and the GPP (502) 
55 control messages to the Switch Transmitter 

Adapter (504), 

2. Switch Transmitter Adapter (504) 
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a. segmenting the packets in cells of fixed 
length adapted to the Switch (503), 

b. generating an error check field to ensure 
the integrity, of the switch header during the 
transmission of the cells over the Switch 
(503). 

The transmit process, as for it, comprises the 
steps of : 

1 . Switch Receiver Adapter (505) 

a. receiving the cells from the Switch (503), 

b. checking the switch header and, in case of 
error, discarding the cell, 

2. Line Receiver, Buffer Memory, Specific Pur- 
pose Processor (506) system 

a. reassembling the data packets, 

b. forwarding the control packets to the Gen- 
eral Purpose Processor (502), 

c. encapsulating the packets with a routing 
header, 

d. receiving control packets from the GPP 
(502), 

e. queueing data and control packets in the 
appropriate queues, 

f. handling the outgoing packets with priority 
given to real time data (and then to non real 
time data). 

It is possible to design the adapters to work 
either in a a proprietary environment with packets 
of variable length, or in a standard mode such as 
ATM (Asynchronous Transmission Mode) with 
short cells of fixed length, or in both where appro- 
priate. In this last case, for performance purpose, 
cells routed on the switch are identical or similar to 
these defined in the ATM protocol with as result : 

• the elimination of the packets segmentation 
and reassembly steps in the Switch Transmit- 
ter (508) and Receiver Adapters (509), 

• a simplification of the switch header process- 
ing in the Specific Purpose Processor (507, 
510). 

Adapter Functional Structure 

The present invention deals with the relation- 
ships between the Line Receiver/Transmitter, the 
Buffer Memory, the Specific Purpose Processor, 
and the Switch Adapter and in particular with the 
handling of the data flow in a way to optimize the 
throughput and the processing time inside the 
adapters. More specifically, the invention relates to 
a very high performance system for queueing and 
dequeueing the data buffers. 

The communication adapters are based on the 
following principles : 

• the Specific Purpose Processor is designed 
minimize the number of operations necessary 
to manage the steady state data flow. 



• data packets and control data are managed 
separately in two distinct memories respec- 
tively the Buffer Memory and the Local Mem- 
ory. 

5 • the data packets buffering and the queueing 
and dequeuing mechanisms are identical for 
all Ports-Trunk and Receive-Transmit adapt- 
ers. 

According to these considerations, the follow- 
w ing conventions will be used to describe the inven- 
tion : 

The device which reads in the Buffer Memory 
and the device which writes into the Buffer Memory 
are designated respectively by 101 and 102. That 
75 means : 

• in the receive side of the adapter, lOI = 
Switch Transmitter and 102 = Line Receiver 

• in the transmit side of the adapter, 101 = 
Line Transmitter and 102 = Switch Receiver. 

20 In the same way : 

• an input data stream goes from the switch or 
external line (102) to the Buffer Memory. 

• an output data stream goes from the Buffer 
Memory to the switch or external line (101). 

25 Furthermore : 

The meaning of "packet" is application depen- 
dent. It may be applied, for example, to an SDLC 
frame from a Port, to a proprietary packet format 
from a Trunk, or to a cell received from an ATM 

30 Trunk. The term "packet" that will be used in the 
following paragraphs will not refer to a precise data 
unit format. 

Data Structures 

35 

Buffers, Packets, Queues Structures 

Data packets are stored in the Buffer Memory 
(BM) while control data are managed directly by 

40 the Specific Purpose Processor (SPP) in the Local 
Memory (LM). The basic unit of memory that can 
be allocated to an input (Line/Switch Receiver 
(I02)) or output device ( Line/Switch Transmitter 
(101)) is a buffer of fixed length. As illustrated in 

45 Figure 6, each of these buffers is represented in 
the Local Memory by a pointer called Buffer Point- 
er (B PTR). A pointer is a generic term to identify 

a logical data structure (buffer, packet, queue ...) 
stored in the Buffer Memory. The storage of a 

50 packet requires one or several buffers. These buff- 
ers are chained together using a list of pointers 
(B_LIST) which is itself represented by a Packet 
Pointer (P_PTR). A list of Packet Pointers 
(P_LIST), identified by a Queue Pointer (Q_PTR), 

55 designates a queue of several packets. 
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List Prefix 

Each list, representing a specific packet or a 
queue structure, is preceded by a Prefix used for 
storing any type of information related to the data 
the structure contains. In Buffer Lists, the Prefix 
(B_PREFIX) contains information related to the 
routing of the packet : 

• the packet header 

• the date of the packet reception 

• the packet length 

All the processor operations on the packet 
header are realized in the the Local Memory (LM) 
without having to access to the data stored in the 
Buffer Memory (BM). Furthermore, when the pro- 
cessor (SPP) is working on the Local Memory, the 
DMA operations on the Buffer Memory are not 
disrupted. The result is a more efficient routing 
process and memory management. 

Packet Segmentation 

To facilitate the memory management, the lists 
used for packets and queues are of fixed length. 
As represented in Figure 7, packets which are 
bigger than the buffer list can contain, are seg- 
mented. This method allows the lists (B_LIST) not 
to be sized at the maximum packet length. The 
segments (B_LIST 1 , B_LIST 2) are identified in 
the packet list (P_LIST) by means of distinct pack- 
et pointers (P_PTR 1 , P_PTR 2). The correlation 
between segments of a same packet is realized 
through a specific Status Field (SF) located in the 

last Buffer Pointer (B PTR) of each segment. This 

field is flagged EOS (End Of Segment) when the 
List is full. In an output operation, 

• If Status Field is found equal to End Of 
Segment (EOS), the next pointer in the pack- 
et list (P_JJST) will identify another segment 
of the same data packet. 

• If Status Field is found equal to End Of 
Packet (EOP), the next pointer in the packet 
list (P_JJST) will identify another data pack- 
et. 

Buffer Pointers 

The Figure 8 describes the general format of a 
Buffer Pointer (801). Buffers (800) need not to be 
full and the data (DDDDDDDD) may start (A) and 
end (B) at any place. For example, it is possible to 
reserve the first bytes of a buffer to include a 
header a posteriori. The data writing and reading is 
realized by means of five fields located in the 
Buffer Pointer (801 ) : 

• the First Element Pointer (FEP) : identification 
of the first data element in the buffer (data 
may start at any place), 



• the Last Element Pointer (LEP) : identification 
of the last data element in the buffer (data 
may end at any place), 

• the Status Field (SF) : this field is used in the 
5 last Buffer Pointer of a list to designate either 

an End Of Segment (EOS) or an End Of 
Packet (EOP), 

• the Buffer IDentifier (BID) : identification of 
the buffer containing the data, 

io • the Current Element Pointer (CEP) : iden- 
tification of the current data element to read 
or to write. 

The status of a Buffer Pointer before the data 
acquisition is such that the Current Element Pointer 

15 (CEP) is set equal to the First Element Pointer 
(FEP = CEP = A) (802). The Current Element 
Pointer is incremented each time a new element is 
stored. At the end of the acquisition, the Current 
Element Pointer designates the last data (CEP = 

20 B) (803). For a subsequent processing, the Current 
Element Pointer value (CEP = B) is then writen in 
place of the Last Element Pointer (LEP) and after- 
wards reset to its initial value (CEP = FEP = A) 
(804). This method allows the transmission of the 

25 data in the order in which they were received. 

List Pointers 

Referring to Figure 9, the List Pointer format 
30 consists of three fields : 

• the List IDentifier (LID) : identification of the 
list, 

• the HEAD : identification of the first pointer of 
the list, 

35 • the TAIL : identification of the next pointer to 
attach to the list. 

Free Buffer List (FBL) 

40 The management of the Buffer Memory is re- 

alized by means of a specific list called Free Buffer 
List (FBL). The FBL comprises the totality of the 
Buffer Pointers and its role is to provide a status of 
the memory occupation (Figure 12) using the 

45 HEAD and TAIL fields of the Free Buffer List Point- 
er (P FBL) : 

• T : total number of buffers in the Buffer 
Memory. 

• HEAD : identification of the first free buffer of 
so the list. Each time a new buffer is filled, the 

HEAD field is incremented. 

• TAIL : identification of the next free buffer of 
the list. Each time a new buffer is released, 
the TAIL field is incremented. 

55 • HEAD = TAIL : the Buffer Memory is full. 

• Incremented TAIL = HEAD : the Buffer 
Memory is empty. 
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The Free Buffer List (FBL) is created at initi- 
ation time and is permanent contrary to other lists 
which are created dynamically (Buffer, Packet or 
Queue Lists). 

Note : in general, when a lack of resources is 
detected (Free Buffer List empty), then the packet 
which cannot be stored in the Buffer Memory is 
discarded. 

Specific Purpose Processor Structure 

The Specific Purpose Processor functional 
structure is illustrated in Figure 1 3. 

Processor Parallel processing 

The Specific Purpose Processor is designed to 
execute up to three operations in parallel : 

1. ALU (Arithmetical and Logical Unit, 140) oper- 
ations on registers 

2. Memory operations 

3. Sequence operations 

The parallelism requires to distinguish instruc- 
tions from operations : 

• Instruction : it is the content of the code 
word. In term of assembler language, the 
instruction corresponds to one line of code. 
All instructions are executed in one processor 
cycle 

• Operation : an instruction may be composed 
of one or more operations which are ex- 
ecuted simultaneously. 

Memory Space 

The SPP memory space is divided in three 
blocks : 

• the Instruction Store (130), 

• The Local Memory (LM, 131), which is the 
code working space, 

• The Buffer Memory (BM, 132) which is the 
repository for data packets when they pass 
through the adapter. 

They all operate in parallel, the Instruction 
Store (130) under control of the Sequencer (133), 
the Local Memory under control of the processor 
code and the Buffer Memory (132) under the con- 
trol of the Direct Access Memory (DMA, 134)). 

Registers 

The registers are divided in two categories : 

1. the General Purpose Registers (GPR) 

These registers are located in the Register 
File (RF, 135) and are available as instruction 
operands. 

2. the Control Registers (CR) 

The CR's are hardware registers which are 



used in specific functions and are available also 
as instruction operands. However, there are less 
degree of freedom in their use, as compared 
with the GPR's. In particular, two of these con- 
5 trol registers (136) are located in the Direct 
Access Memory (DMA, 134). 

• CR1 = D__PTR1 (DMA Pointer 101) 

• CR2 = D__PTR2 (DMA Pointer I02) 
DMA Pointers 1 and 2 are associated to 

w input/output 101 (137) and I02 (138) and they 
both contain the current Buffer Pointer 
(B PTR). 

Memory Address Generator (MAG, 139) 

15 

In all load or store operations, on the Local or 
on the Buffer Memory, the physical address is 
reconstituted from the different fields of Buffer or 
List Pointer, used as operand. For performance 
20 reason, this operation is realized by a specialized 
hardware component called Memory Address Gen- 
erator (MAG, 139). 

Direct Memory Access Controller (DMA, 134)) 

25 

The use of a Direct Memory Access Controller 
(DMA, 134) jointly with a processor is well-known in 
the state of the art. Its role is to quickly move the 
data packets between the 10 devices (137, 138) 

30 and the Buffer Memory (1 32) without the processor 
(SPP) intervention. The DMA module consists of 
two independent programmable channels. The 10 
devices present their service requests (SR1, SR2) 
to the DMA which controls the access to the Buffer 

35 Memory (132). The processor intervention is need- 
ed only at buffer and packet boundaries. The data 
streams between the two 10 devices and the Buffer 
Memory is processed in parallel with the code. Up 
two IO operations can be multiplexed on the BMIO 

40 bus; one with 101 and the other with I02. For that 
the DMA manages the two DMA Pointers 
(D_PTR1 and D_PTR2) which are nothing else 
than Buffer Pointers. 

45 Input Output Subsystems 

• The Specific Purpose Processor (SPP) is 
considered as the "master" and it establishes 
the connections. 

50 • The IO devices and the Buffer Memory are 
controlled either directly by the processor 
code, or via the DMA. 

• The code intervention can be forced by the 
IO devices via the Interrupt mechanism in 

55 case of buffer or packet boundary. 



11 



JDOCID: <EP 0632625A1_I_> 



19 



EP 0 632 625 A1 



20 



Data Processing 

Various processing can be made on Buffer and 
List Pointers : 

• Incrementing a Buffer Pointer; 5 

• Closing a buffer, 

• Accessing a List Prefix, 

• Attaching an element to a list, 

• Detaching an element from a list. 

10 

Operations on Pointers 

Some operations on pointers are performed by 
the processor code, others by the Direct Memory 
Access (DMA). 75 

• The writing and reading of the Buffer Pointers 
is exclusively the fact of the DMA (134). 

■ The writing in the Buffer Memory : 

At the reception of a service request 
(SR2) from an I02, the DMA has access to 20 
the Buffer Memory (1 32) by means of the 
address contained in the Pointer 2 
(D_PTR2, 136). The DMA Pointer 2 is 
provided by the processor (SPP) and is 
nothing else than a Buffer Pointer 25 
(B__PTR). The DMA orders simultaneously 
the I02 (138) to present a data element on 
the BMIO bus and the Buffer Memory 
(132) to write this data element at the 
Current Element Pointer CEP) position in 30 
the buffer identified by the BID (Buffer 
IDentifier) field in the DMA Pointer. Data 
are filled into the buffer starting an ad- 
dress chosen by the code and ending at 
the bottom of the buffer except the last 35 
buffer (of a packet) where data may end at 
any place. Before starting the input opera- 
tion, the FEP (First Element Pointer) and 
the CEP (Current Element Pointer) fields 
in DMA Pointer are both on the first data 40 
position. There is no alignment constraints. 
The Current Pointer (CEP) is incremented 
each time new data element is stored. 
When the buffer is full, the DMA demands 
from the processor a new Buffer Pointer 45 
through an interrupt mechanism 
(I02_E0B routine). A similar procedure is 
used when the I02 detects the end of a 
packet (I02_E0P routine)). After the data 
acquisition, the Current Element Pointer 50 
(CEP) is reset to the first data position 
(CEP = FEP). 

■ The Reading in the Buffer Memory : 

At the reception of a service request 
(SR1) from an 101, the DMA addresses the 55 
Buffer Memory (132) by means of the BID 
field (Buffer IDentifier) contained in the 
DMA Pointer 1 (D_PTR1, 136). The DMA 



Pointer 1 is provided by the processor 
(SPP). The DMA orders simultaneously 
the Buffer Memory (132) to present on the 
BMIO bus a data element at the Current 
Element Pointer position in the buffer iden- 
tified by the BID field of the DMA Pointer 
and the 101 device (138) to read this data 
element. Before starting the output opera- 
tion, the FEP (First Element Pointer) and 
the CEP (Current Element Pointer) fields 
in DMA Pointer are both on the first data 
position. The Current Pointer (CEP) is in- 
cremented each time new data element is 
transferred. The buffer is empty when the 
Current Element Pointer value is equal to 
the Last Element Pointer value. At that 
time, the DMA demands a new Buffer 
Pointer from the processor through an in- 
terrupt mechanism (I01_E0B routine). A 
similar procedure is used when the DMA 
detects a end of a segment or packet 
(I01_E0S or I01__E0P routine). After the 
data transfer, the Buffer Pointer is released 
in the Free Buffer List, Packet List and 
Queue List are updated accordingly. 
• Packet and Queue Pointers are managed by 
the processor code : 

■ the code intervention is forced by the IO 
devices and DMA via the Interrupt mecha- 
nism in case of buffer or packet boundary. 

■ the buffer and packet queueing and de- 
queuing mechanisms are executed under 
the control of the processor in the Local 
Memory. 

Interrupts 

The Interrupt mechanism is the way real time 
is supported by the Specific Purpose Processor 
(SPP). An Interrupt is a break, due to particular 
events, in the normal sequence of the processor 
code. The events which can cause this Interrupt 
are the service requests from the IO devices asso- 
ciated with specific conditions such as end of buff- 
er, end of packet ... At each specific Interrupt 
corresponds a specific routine which cannot be 
interrupted by another routine. It is very important 
to have interrupt routines as short as possible to 
avoid overrun/underrun problems. 

1 . The following Interrupts are used by the Line 
and Switch Transmitters (101). 
• I01_E0B: 

Condition : when serving an output 
101, the DMA has emptied a buffer (which 
is not the last buffer of the packet/segment 
- the Buffer Pointer is not flagged EOS or 
EOP) the DMA Pointer (D_PTR1) raises a 
I01__E0B interrupt which triggers a 
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I01_E0B routine. 

Routine : this routine 

■ releases the pointer of the just emptied 
buffer in the Free Buffer List. A new 
pointer is dequeued from the Output 5 
Buffer List (OB_LIST) and passed to 

the DMA Pointer (D_PTR1 ). 

• l01_EOP : 

Condition : when serving an output 
. 101, the DMA has emptied a packet (the w 
last buffer of a packet which pointer con- 
tains the EOP flag ON), the DMA Pointer 
(D__PTR1) raises a I01_E0P interrupt 
which triggers a I01_E0P routine. 

Routine : the routine : 75 

■ releases the current and last Buffer 
Pointer of the Output Buffer List 
(0B_LIST) in the Free Buffer List 
(FBL). 

■ detaches the current Output Packet 20 
Pointer (OP_PTR) from the Output 
Packet List (OP_LIST). 

■ dequeues the next packet and its point- 
er from the Output Packet List 

• I01_E0S: 25 

Condition : when serving an output 

101, the DMA has emptied a segment - 
(the last buffer of a packet segment which 
pointer contains the EOS flag ON and the 
EOP flag OFF), the DMA Pointer 30 
(D_PTR1) raises a I01__EOS interrupt 
which triggers a I01_E0S routine. 

Routine : this routine : 

■ releases the pointer of the just emptied 
EOS buffer in the Free Buffer List 35 
(FBL). 

■ detaches the pointer of the just emptied 
packet in the Output Packet List. 

■ dequeues the pointer of the next seg- 
ment from the Output Packet List. 40 

2. These Interrupts are used by the Line and the 
Switch Receiver (102). 

• I02_E0B: 

Condition : when serving an input 

102, the DMA has detected a Buffer Full 45 
condition (That buffer is not the last buffer 

of a packet), the DMA Pointer (D_PTR2) 
raises a I02__EOB interrupt which triggers 
a I02__E0B routine. 

Routine this routine : 50 

■ stores the pointer of the just filled up 
buffer in a pre-allocated Input Buffer 
List (IB_J_IST) where all buffers of the 
same packet are chained. 

■ updates the Input Buffer List prefix area 55 
(IBL_PREFIX). 

■ provides the DMA Pointer (D_PTR2) 
with a new Buffer Pointer for continuing 



the reception of the data of the same 
packet. Free Buffer Pointers are man- 
aged by the Free Buffer List (FBL). 

■ when for the same packet, the Buffer 
List is full, the segmentation takes 
place. 

• l02_EOP: 

Condition ; when serving an input 
102, the DMA has completed the recep- 
tion of a packet (the last buffer of the 
packet which pointer is flagged EOM by 
the hardware), the DMA Pointer raises a 

102 EOP interrupt which triggers a 

102 EOP routine. The code intervention 

is required to provide a new Buffer Pointer 
for the reception of a new packet. 
Routine : this routine : 

■ stores the pointer of the last filled buffer 
in a pre-allocated Input Buffer List 
(IB_LIST) where all buffers of the 
same packet are automatically chained. 
The last Buffer Pointer of a packet is 
flagged EOP by the DMA Pointer 
(D__PTR2). 

■ updates the Input Buffer List prefix area 
(IBL_PREFIX). 

■ the Packet Pointer of the Input Buffer 
List is queued in the Input Packet List 
(IP_LIST). 

■ provides the DMA Pointer (D_PTR2) 

with a new Buffer Pointer (B PTR) for 

the reception of the next packet. 

Queuing and Dequeuing Operations 

The process of attaching and detaching, object 
of the present invention, involves the following op- 
erations : 

• LIN RA,RT : List IN 

• LOUT RA.RT : List OUT 

• LLEH RA.RT : Load List Element HEAD 

• STLET RA.RB : Store List Element TAIL 

• GOTO (COND.) ADDR : Go to an immediate 
ADDRess 

ALU operations on registers 

1.LIN RA,RT (Figure 10) 

The LIN (List IN) operation increments the 
TAIL field of the List Pointer referenced by RA 
and stores the result into the register referenced 
by RT. If the list is full, the RA contents is 
stored into RT without change. LIN is usually 
used in conjunction with the STLET memory 
operation to attach an element to a list in the 
TAIL position. In that case RT = RA. The list full 
indicator (F) is updated as follows : 

IF INCREMENTED (TAIL) = HEAD then set 
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F = 1 (List full) else set F = 0 
A list full condition detected on the LIN opera- 
tion does not prevent the execution of the 
STLET. However the list integrity is not de- 
stroyed. 

Example : let's consider a 16 element list 
(elements of 4 bytes) located at address 
'001 800\ It contains six elements, its HEAD 
address is '001820', its TAIL address is 
'001838'; so its List Pointer is '00182038' with 
(as example) : 

• 2 bytes : List IDentifier (0018) 

• 1 byte : HEAD (20) 

• 1 byte : TAIL (38) 

(0 a 1 bits : list size 6 bits spare (0), 8 a 23 LID, 
24 a 27 HEAD, 28 a 31 TAIL, LID sur 12 bits + 
4 bits a 0). It is supposed to be held in R20. The 
operation LIN R20,R20 returns '00182030'. If a 
second operation LIN R20,R20 was performed 
(assuming no element has been detached by a 
LOUT operation in the mean time), it would 
return '00182040*. 
2. LOUT RA,RT (Figure 11) 

The LOUT (List OUT) operation increments 
the HEAD field of the List Pointer referenced by 
RA and stores the result into the register re- 
ferenced by RT. If the list is empty, that means 
if the HEAD is equal to the TAIL field, the RA 
contents is unchanged. The list empty indicator 
(E) is updated as follows : 
IF TAIL = HEAD then set E = 1 (List empty) 
else set E = 0 

LOUT is usually used in conjunction with the 
LLEH memory operation to detach the HEAD 
element of a list. In that case RT = RA. 

Example : let's consider a 16 element list 
located at address '003400' containing 5 ele- 
ments. Its HEAD address is '003438' and its 
TAIL address is '00341 C\ So its List Pointer is 
'00343810'. It is supposed to be held in R30. 
The operation LOUT R30.R30 returns 
'00343C1C'. If a second operation LOUT 
R30,R30 was performed (assuming no element 
has been attached by a LIN operation in the 
mean time), it would return '0034401 C\ 

Memory Operations 

1. LLEH RA,RT 

The LLEH (Load List Element Head) opera- 
tion moves the HEAD element from the list 
referenced by RA to the RT register. LLEH is 
usually used in conjunction with LOUT operation 
to detach an element (Buffer or List Pointer) 
from the HEAD of a list. When used in conjunc- 
tion LOUT, RT must not be equal to RA and the 
register pointed by RA must be the same as 
that of LOUT. 



Example : let's consider a 16 element list 
located at address '003400* containing 5 ele- 
ments. Its HEAD address is '003438' and its 
TAIL address is '00341 C\ So its list pointer is 
5 '0034381 C\ It is supposed to be held in R30. 
The operation LLEH R30,R40 returns the con- 
tents of the Local Memory location '003438' into 
R40. 

2. STLET RA,RB 

10 The STLET (Store List Element Tail) opera- 

tion stores the element referenced by RB into 
the list referenced by RA at the TAIL position 
RA and RB are List Pointers. 

STLET is usually used in conjunction with 

75 LIN (List IN) to attach an element to the TAIL of 
a list. 

Example, let's consider a 16 elements list 
located at address '003400' containing 5 ele- 
ments. Its HEAD address is '003438* and its 

20 TAIL address is '00341 C*. So its List Pointer is 
'0034381 C\ It is supposed to be held in R30. 
The operation STLET R30.R40 writes the con- 
tents of R40 into Local Memory location 
■00341 C\ 

25 Note : 

a. STLET can be used, without LIN, to modify 
the pointer of the last buffer of a packet. 

b. If an element is attached to a full list, the 
list full condition (F = 1) which is detected 

30 by the LIN operation, does not prevent the 

execution of the STLET operation. However, 
the store alters a non significant list element. 

Sequence operations 

35 

The condition indicators set by the LIN and 
LOUT operations : 

• E List Empty 

• F List Full 

40 reflect a processor state which can be tested in the 
same instruction or later by a conditional sequence 
operation. 

1. GOTO (COND,)ADDR 

If the tested condition (if any) is met, the 
45 new code address is given by the ADDR 
operand. If not the code proceeds in sequence. 

Typical Instructions 

50 • The following instruction allows to detach an 
element (Buffer or List Pointer) from the 
HEAD of a list, the List Pointer being given 
by RA (Figure 14) : 

LOUT RA,RA LLEH RA,RB 

55 The LOUT (ALU operation 140) incre- 

ments the HEAD field of the List Pointer 
referenced by RA (Register File 141) (The 
RA register is updated with the new HEAD 
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value at the next clock signal). Simultaneous- 
ly, the HEAD element of the list referenced 
by RA is moved from the Local Memory 
(142) to the register RB (141) by means of 
the memory operation LLEH. 

• The following instruction allows to attach an 
element (Buffer or List Pointer) given by RB 
to the TAIL of a list, the List Pointer being 
given by RA (Figure 15) : 

LIN RA.RA STLET RA.RB 

The LIN (ALU operation 150) increments 
the TAIL field of the List Pointer referenced 
by RA (Register File 151). (The RA register is 
updated with the new TAIL value at the next 
clock signal). Simultaneously, the TAIL ele- 
ment of the list referenced by RA is moved 
from the register RB to the Local Memory 
(152) by means of the memory operation 
STLET. 

• The following instruction allows to release a 
buffer (its pointer being given by RB) to the 
TAIL of the Free Buffer List, the pointer of 
which is given by RA : 

LIN RA.RA STLET RA.RB 

Queueing & Dequeueing Cycles 

Figure 16 shows a simplified view of the de- 
queueing cycle. As detailed in previously, the de- 
tach instruction requires two operations (LOUT and 
LLEH) which are processed in parallel. Within the 
same processor cycle (160, 161), the contents of 
the RA register (162) used as input operand is 
applied simultaneously to the ALU and to the Local 
Memory by means of the U Bus (163). The ALU 
increments the HEAD field of the pointer. The 
register RA (162) and RB (165) are updated re- 
spectively with the new pointer value and the 
HEAD element at the clock signal (166). The 
queueing process is similar. 

Performance 



time and this processing time is directly related to 
the number of instructions required to handle a 
packet. The operations on the packets can be 
divided in two categories : 
5 • the buffering process with the interrupt rou- 
tines. This operation is generic for all adapter 
types. The processing time required for this 
operation is directly proportional to the pack- 
et length. 

10 • the background process with the operations 
of routing, assembling - disassembling, for- 
matting of the packets , management of the 
bandwidth, priority, ... This process is de- 
signed according to the adapter function, but 

75 in the same application, the background pro- 

cess is identical for all packets independently 
of their size. 

The interrupt routines are the way real time is 
supported by the processor. They must be as short 

20 as possible to avoid the overrun on the input de- 
vice (101) and the underrun on the output device 
(101). They commands the performance of the 
adapter in term data throughput (bits per second). 
For packets of infinite length, the background pro- 

25 cess disappears and the throughput bottleneck is 
the DMA. The DMA response time, which depends 
of the interrupt response time, is optimized by 
means of the queuing and dequeueing operations 
objects of the present invention. 

30 In another way, to maximize the packet 

throughput (number of packets per seconds that 
the adapter is able to transmit), the number of 
instructions required by the background process 
must be reduced to a minimum. As in the previous 

35 case, the queueing and dequeueing operations are 
optimized by means of the present invention (only 
one processor cycle or instruction). Furthermore, 
the particular data organization between the dif- 
ferent memory entities contributes to an efficient 

40 manipulation of the packets. 

Claims 



The throughput of a communication adapter is 
defined as the total time required to handle a data 
packet from the input to the output. However, the 
packet size being application dependent, two mea- 
sures are currently used to evaluate the perfor- 
mances of adapters : 

• first, the number of packets of fixed length 
the adapter is able to handle in a second 
(packet throughput), 

• second, the number of bits per second the 
adapter is able to transmit in case of infinite 
packet length (data throughput). 

The performances depend on the hardware 
and on the processor capabilities but the main 
throughput limiting factor is the packet processing 



1. A line adapter (401, 402) for a packet switching 
45 node (300) in a communication network (200), 

including programmable processing means 
(SPP) for receiving (138) and transmitting (137) 
data packets of fixed or variable length, char- 
acterized in that it further comprises : 
50 • means for buffering (132) said data pack- 

ets, 

• means for identifying said data packets 
in said buffering means, 

• means for queueing (Figure 15) said 
55 identifying means in storing means (131) 

in a single instruction, 

• means for dequeueing (Figure 16) from 
said storing (131) means said identifying 
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means in another single instruction , 

• means for releasing said buffering 
means, 

each of said queueing and dequeueing 
instructions comprising up to three operations 5 
executed in parallel by said processing means 

• an arithmetical and logical (ALU) opera- 
tion on said identifying means, 

• a memory operation on said storing 10 
means, and 

• a sequence operation. 

2. The programmable adapter according to claim 

1 wherein : 15 

• said buffering means includes means for 
writing and reading said data packets in 
buffers (800) of fixed length under control 
of a direct memory access module (134), 

• said storing means includes means for 20 
separately storing (131) said identifying 
means under control of said processing 
means (SPP), 

3. The programmable adapter according to claim 25 

2 wherein said identifying means includes : 

• buffer pointers (B PTR) identifying said 

buffers and chained in one or more buff- 
er lists (B_LIST), 

• packet pointers (P_PTR) identifying said 30 
buffer lists (B__LIST) and chained in one 

or more packet lists (P_LIST), 

• queue pointers (Q__PTR) indentifying 
said packet lists (P_LIST) and chained 

in one or more queue lists (Q__LIST). 35 
and wherein each list comprises a prefix 
for storing any type of information related to 
the data the list contains. 

4. The programmable adapter according to claim 40 

3 wherein said buffer list prefix includes : 

control and routing information contained 
in the data packet header. 

5. The programmable adapter according to any- 45 
one of claims 3 to 4 wherein each list pointer 
comprises : 

• a first field for identifying a pointer list 
(LID). 

• a second field for identifying the next 50 
pointer (TAIL) to attach to said pointer 

list, 

• a third field for identifying the first pointer 
(HEAD) in said pointer list. 

55 

6. The programmable adapter according to any- 
one of claims 3 to 5 wherein each buffer 
pointer comprises : 



• a first field for identifying a buffer (BID), 

• a second field for identifying a first data 
element in said buffer (FEP), 

• a third field for identifying a last data 
element in said buffer (LEP), 

• a fourth field for identifying a current data 
element in said buffer CEP), and 

• a fifth field used as status indicator (SF). 

7. The programmable adapter according to claim 
6 wherein : 

• buffer lists are of fixed length, 

• buffer pointers can be chained in a plu- 
rality of buffer lists when the data packet 
to store is bigger than a single buffer list 
can contain, 

• said status indicator is flagged in in the 
last buffer pointer of each buffer list en- 
suring a correlation between the different 
lists of said data packet. 

8. The programmable adapter according to 
claims 5 to 7 wherein said queueing instruction 
comprises means : 

• for incrementing the TAIL field of a list 
pointer, 

• for simultaneously storing the pointer 
identified by the TAIL field in the pointer 
list identified by the LID field, 

• for generating list empty (E) or list full (F) 
indicators. 

9. The programmable adapter according to 
claims 5 to 8 wherein said dequeueing instruc- 
tion comprises means : 

• for incrementing the HEAD field of a list 
pointer, 

• for simultaneously reading the pointer 
identified by the HEAD field, in the point- 
er list identified by the LID field, 

• for generating list empty (E) or list full (F) 
indicators. 

10. The programmable adapter according anyone 
of claims 8 to 9 wherein said instruction in- 
cludes a further means : 

• for testing said list empty (E) or list full 
(F) indicators. 

11. The programmable adapter according to any- 
one of claims 3 to 10 wherein the management 
of the data packet buffering means is realized 
by means of a permanent list (Free Buffer List) 
containing the totality of the buffer pointers. 

12. The programmable adapter according to any- 
one of preceding claims wherein said process- 
ing means includes : 
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• an arithmetical and logical unit, 

• a register file, 

• a sequencer, 

• an instruction file, 

• a direct access memory module, 5 

• a physical memory address generator. 

13. A method for queueing and dequeueing data 
packets of fixed or variable length in a line 
adapter (401, 402) of a packet switching node jo 
(300), said line adapter including programma- 
ble processing means (SPP), buffering (132) 

and storing means (131) for receiving (138) 
and transmitting (137) data packets of fixed or 
variable length, said method characterized in 75 
that it comprises the steps of : 

• buffering (132) said data packets in said 
buffering means, 

• defining identifying means and identify- 
ing said buffering means and said data 20 
packets in said buffering means, 

• queueing (Rgure 15) in storing means 
(131) said identifying means in a single 
instruction, 

• dequeueing (Figure 16) from said storing 25 
(131) means said identifying means in 
another single instruction , 

each of said queueing and dequeueing 
instructions comprising up to three operations 
executed in parallel by said processing means 30 

• an arithmetical and logical (ALU) opera- 
tion on said identifying means, 

• a memory operation on said storing 
means, and 35 

• a sequence operation. 

14. The method according to claim 13 wherein : 

• said buffering means includes means for 
writing and reading said data packets in aq 
buffers (800) of fixed length under control 

of a direct memory access module (134), 

• said storing means includes means for 
separately storing (131) said identifying 
means under control of said processing 45 
means (SPP), 

15. The method according to claim 14 wherein 
said identifying means includes: 

• buffer pointers (B__PTR) identifying said 50 
buffers and chained in one or more buff- 
er lists (B_LIST), 

• packet pointers (P_PTR) identifying said 
buffer lists (B_LIST) and chained in one 

or more packet lists (P LIST), 55 

• queue pointers (Q__PTR) indentifying 
said packet lists (P__LIST) and chained 
in one or more queue lists (Q_LIST). 



and wherein each list comprises a prefix 
for storing any type of information related to 
the data the list contains. 

16. The method adapter according to claim 15 
wherein said buffer list prefix includes : 

control and routing information contained 
in the data packet header. 

17. The method according to anyone of claims 15 
to 16 wherein each list pointer comprises : 

• a first field for identifying a pointer list 
(LID). 

• a second field for identifying the next 
pointer (TAIL) to attach to said pointer 
list, 

• a third field for identifying the first pointer 
(HEAD) in said pointer list. 

18. The method according to anyone of claims 15 
to 17 wherein each buffer pointer comprises : 

• a first field for identifying a buffer (BID), 

• a second field for identifying a first data 
element in said buffer (FEP), 

• a third field for identifying a last data 
element in said buffer (LEP), 

• a fourth field for identifying a current data 
element in said buffer CEP), and 

• a fifth field used as status indicator (SF). 

19. The method according to claim 18 wherein : 

• buffer lists are of fixed length, 

• buffer pointers can be chained in a plu- 
rality of buffer lists when the data packet 
to store is bigger than a single buffer list 
can contain, 

• said status indicator is flagged in in the 
last buffer pointer of each buffer list en- 
suring a correlation between the different 
lists of said data packet. 

20. The method according to claims 17 to 19 
wherein said queueing instruction comprises 
the steps of: 

• incrementing the TAIL field of a list point- 
er, 

• simultaneously storing the pointer iden- 
tified by the TAIL field in the pointer list 
identified by the LID field, 

• generating list empty (E) or list full (F) 
indicators. 

21. The method according to claims 17 to 20 
wherein said dequeueing instruction comprises 
the steps of: 

• incrementing the HEAD field of a list 
pointer, 
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• simultaneously reading the pointer iden- 
tified by the HEAD field, in the pointer 
list identified by the LID field, 

• generating list empty (E) or list full (F) 
indicators. 



22. The method according to claims 20 to 21 
wherein said instruction includes the further 
step of: 

• testing said list empty (E) or list full (F) 10 
indicators. 
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